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<110> 

<120> 

<130> 

<160> 

<170> 

<210> 
<211> 
<212> 
<213> 



Novozymes A/s 
Protease Variants 
10508 
21 

Patentin version 3.2 



1 

1596 
DNA 

Nocardiopsis sp. 



NRRL 18262 ("Protease 10") 



<220> 
<221> 
<222> 

<220> 
<221> 
<222> 



CDS 

(318) . . (1463) 



sig_peptide 
(318). -(404) 



<220> 

<221> mat_peptide 

<222> (900) . . (1463) 

<400> 1 



acgtttggta 


cgggtaccgg 


tgtccgcatg 


tggccagaat 


gcccccttgc 


gacagggaac 


60 


ggattcggtc 


ggtagcgcat 


cgactccgac 


aaccgcgagg 


tggccgttcg 


cgtcgccacg 


120 


ttctgcgacc 


gtcatgcgac 


ccatcatcgg 


gtgaccccac 


cgagctctga 


atggtccacc 


180 


gttctgacgg 


tctttccctc 


accaaaacgt 


gcacctatgg 


ttaggacgtt 


gtttaccgaa 


240 


tgtctcggtg 


aacgacaggg 


gccggacggt 


attcggcccc 


gatcccccgt 


tgatcccccc 


300 


aggagagtag 


ggacccc atg 
Met 


1 cga ccc tec ccc gtt 
: Arg Pro ser Pro Val 

-190 


gtc tec gee ate gqt 
val ser Ala He Gly 

-185 


350 



acg gga gcg ctg gee ttc got ctg gcg ctg tec gqt ace ccg gqt 
Thr Gly Ala Leu Ala Phe Gly Leu Ala Leu ser Gly Thr Pro Gly 



-180 



-175 



-170 



gcc etc gcg gcc ace gga gcg etc ccc cag tea ccc aec ccg gag 

Ala Leu Ala Ala Thr Gly Ala Leu Pro Gin ser Pro Thr Pro Glu 

-165 -160 -155 

gcc gae gcg gtc tec atg cag gag gcg etc cag cge gac etc gac 

Ala Asp Ala Val ser Met Gin Glu Ala Leu Gin Arg Asp Leu Asp 

-150 -145 -140 

ctg acc tec gcc gag gcc gag gag ctg ctg gee gcc cag gae aec 

Leu Thr Ser Ala Glu Ala Glu Glu Leu Leu Ala Ala Gin Asp Thr 

-135 -130 -125 

gee ttc gag gtc gac gag gcc gcg gcc gag gee gcc ggg gac gee 

Ala Phe Glu val Asp Glu Ala Ala Ala Glu Ala Ala Gly Asp Ala 

-120 -115 -110 

tac ggc ggc tec gtc ttc gac acc gag age ctg gaa ctg acc gtc ctg 

Tyr Gly Gly ser val Phe Asp Thr Glu Ser Leu Glu Leu Thr val Leu 



-105 



-100 



-95 
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395 



440 



485 



530 



575 



623 
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gtc acc gat gcc gcc gcg gtc gag gcc gtg gag gcc acc ggc gcc ggg 

val Thr Asp Ala Ala Ala Val Glu Ala Val Glu Ala Thr Gly Ala Gly 
-90 -85 -80 

acc gag ctg gtc tec tac ggc ate gac ggt etc gac gag ate gtc eag 

Thr Glu Leu Val Ser Tyr Gly lie Asp Gly Leu Asp Glu lie val Gin 

•^7^ -70 



ggt ggt ctg gee tac acc atg ggc ggc cgc tgt teg gtc ggc ttc gcg 
Gly Gly Leu Ala Tyr Thr Met Gly Gly Arg cys Ser Val Gly Phe Ala 
5 10 15 20 

gcc acc aac gcc gcc ggt eag cec ggg ttc gtc ace gee ggt eac tgc 
Ala Thr Asn Ala Ala Gly Gin Pro Gly Phe val Thr Ala Gly His cys 

25 30 35 

ggc cgc gtg ggc ace eag gtg acc ate ggc aac ggc agg ggc gtc ttc 
Gly Arg Val Gly Thr Gin Val Thr lie Gly Asn Gly Arg Gly val Phe 

40 45 50 

gag eag tec gtc ttc ccc ggc aac gac gcg gcc ttc gtc cgc ggt aeg 
Glu Gin ser Val Phe Pro Gly Asn Asp Ala Ala Phe val Arg Gly Thr 
55 60 65 



tac gee aeg gtc gee ggt eac aac eag gee cec ate ggc tec tec gtc 
Tyr Ala Thr Val Ala Gly His Asn Gin Ala Pro lie Gly ser Ser val 
85 90 95 100 



cgc ggc eag teg gtg age tac ccc gag ggc acc gtc ace aac atg ace 

Arg Gly Gin ser Val Ser Tyr Pro Glu Gly Thr Val Thr Asn Met Thr 

120 125 130 

egg acc acc gtg tgc gcc gag cec ggc gac tec ggc ggc tec tac ate 

Arg Thr Thr Val Cys Ala Glu Pro Gly Asp ser Gly Gly Ser Tyr lie 
135 140 145 



cgc ace ggc ggg ace acc ttc tac eag gag gtc acc cec atg gtg aac 

Arg Thr Gly Gly Thr Thr Phe Tyr Gin Glu Val Thr Pro Met Val Asn 

165 170 175 180 
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671 



719 



gag etc aac gcc gee gac gcc gtt ccc ggt gtg gtc ggc tgg tac ccg 767 
Glu Leu Asn Ala Ala Asp Ala val Pro Gly Val val Gly Trp Tyr Pro 
-60 -55 -50 -45 

gac gtg gcg ggt gac acc gtc gtc ctg gag gtc ctg gag ggt tee gga 815 
Asp val Ala Gly Asp Thr Val Val Leu Glu val Leu Glu Gly ser Gly 

-40 -35 -30 

gee gac gtc age ggc ctg etc gcg gac gee ggc gtg gac gee teg gcc 863 
Ala Asp Val Ser Giy Leu Leu Ala Asp Ala Gly Vai Asp Ala Ser Ala 

-25 -20 -15 

gtc gag gtg ace aeg age gac eag ccc gag etc tac gcc gac ate ate 911 
val Glu val Thr Thr Ser Asp Gin Pro Glu Leu Tyr Ala Asp lie He 
-10 -5 -1 1 



959 



1007 



1055 



1103 



tec aac ttc aeg ctg acc aac ctg gtc age cgc tac aac acc ggc ggg 1151 
Ser Asn Phe Thr Leu Thr Asn Leu Val Ser Arg Tyr Asn Thr Gly Gly 
70 75 80 



1199 



tgc cgc tee ggc tec acc acc ggt tgg eac tgt ggc ace ate eag gee 1247 
Cys Arg ser Gly Ser Thr Thr Gly Trp His cys Gly Thr lie Gin Ala 

105 110 115 



1295 



1343 



tec ggc acc eag gcc eag ggc gtg ace tec ggc ggc tec ggc aac tgc 1391 
Ser Gly Thr Gin Ala Gin Gly val Thr ser Gly Gly ser Gly Asn cys 
150 155 160 



1439 
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tec tgg gqc gtc cgt etc egg acc tgatccccge ggttccaggc ggaccgacgg 1493 
Ser Trp Gly val Arg Leu Arg Thr 

185 

tcgtgacctg agtaccagge gtccccgccg cttecagcgg cgtccgcacc ggggtgggac 1553 

egggcgtggc cacggcccca cccgtgaccg gaccgcccgg eta 1596 

<210> 2 
<211> 382 
<212> PRT 

<213> Noeardiopsis sp. NRRL 18262 ("Protease 10") 
<400> 2 

Met Arg Pro ser Pro Val Val ser Ala lie Gly Thr Gly Ala Leu 

-190 -185 -180 

Ala Phe Gly Leu Ala Leu ser Gly Thr Pro Gly Ala Leu Ala Ala 

-175 -170 -165 

Thr Gly Ala Leu Pro Gin ser Pro Thr pro Glu Ala Asp Ala val 

-160 -155 -150 

Ser Met Gin Glu Ala Leu Gin Arg Asp Leu Asp Leu Thr Ser Ala 

-145 -140 -135 

Glu Ala Glu Glu Leu Leu Ala Ala Gin Asp Thr Ala Phe Glu val 

-130 -125 -120 

Asp Glu Ala Ala Ala Glu Ala Ala Gly Asp Ala Tyr Gly Gly ser 

-115 -110 -105 

Val Phe Asp Thr Glu ser Leu Glu Leu Thr Val Leu Val Thr Asp Ala 

-100 -95 -90 

Ala Ala val Glu Ala Val Glu Ala Thr Gly Ala Gly Thr Glu Leu val 

-85 -80 -75 

Ser Tyr Gly lie Asp Gly Leu Asp Glu lie val Gin Glu Leu Asn Ala 
-70 -65 -60 

Ala Asp Ala Val Pro Gly val Val Gly Trp Tyr Pro Asp Val Ala Gly 
-55 -50 -45 

Asp Thr val val Leu Glu Val Leu Glu Gly ser Gly Ala Asp val ser 
-40 -35 -30 -25 

Gly Leu Leu Ala Asp Ala Gly val Asp Ala ser Ala Val Glu val Thr 

-20 -15 -10 

Thr Ser Asp Gin Pro Glu Leu Tyr Ala Asp lie lie Gly Gly Leu Ala 

-5 -11 5 
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Tyr Thr Met Gly Gly Arg cys ser Val Gly phe Ala Ala Thr Asn Ala 
10 15 20 

Ala Gly Gin Pro Gly Phe Val Thr Ala Gly His Cys Gly Arg Val Gly 
25 30 35 40 

Thr Gin val Thr lie Gly Asn Gly Arg Gly val Phe Glu Gin Ser val 

45 50 55 

Phe pro Gly Asn Asp Ala Ala Phe Val Arg Gly Thr Ser Asn Phe Thr 

60 65 70 

Leu Thr Asn Leu val Ser Arg Tyr Asn Thr Gly Gly Tyr Ala Thr val 
75 80 85 

Ala Gly His Asn Gin Ala Pro lie Gly ser ser val cys Arg ser Gly 
90 95 100 

ser Thr Thr Gly Trp His Cys Gly Thr lie Gin Ala Arg Gly Gin Ser 
105 110 115 120 

Val ser Tyr Pro Glu Gly Thr val Thr Asn Met Thr Arg Thr Thr Val 

125 130 135 

cys Ala Glu Pro Gly Asp Ser Gly Gly ser Tyr lie ser Gly Thr Gin 

140 145 150 

Ala Gin Gly val Thr ser Gly Gly ser Gly Asn Cys Arg Thr Gly Gly 
155 160 165 

Thr Thr Phe Tyr Gin Glu Val Thr Pro Met Val Asn ser Trp Gly Val 
170 175 180 

Arg Leu Arg Thr 
185 

<210> 3 
<211> 1065 
<212> DNA 

<213> Nocardiopsis dassonvillei subspecies dassonvillei dsm 43235 ("Protease 
18 J 

<220> 

<221> CDS 

<222> (1) . . (1062) 

<220> 

<221> mat_peptide 
<222> (4995 . . (1062) 

<400> 3 

get ccg gcc ccc gtc ccc cag acc ccc gtc gcc gac gac age gcc 45 
Ala Pro Ala Pro Val Pro Gin Thr Pro val Ala Asp Asp Ser Ala 
-165 -160 -155 
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gcc age atg acc gag gcg etc aag cgc gac etc gac etc ace teg 90 
Ala Ser Met Thr Glu Ala Leu Lys Arg Asp Leu Asp Leu Thr ser 
-150 -145 -140 

gee gag gcc gag gag ctt etc teg gcg cag gaa gcc gcc ate gag 135 
Ala Glu Ala Glu Glu Leu Leu Ser Ala Gin Glu Ala Ala lie Glu 
-135 -130 -125 

acc gac gcc gag gcc acc gag gcc gcg ggc gag gcc tac ggc ggc 180 
Thr Asp Ala Glu Ala Thr Glu Ala Ala Giy Glu Ala Tyr Gly Gly 
-120 -115 -110 

tea etg ttc gac acc gag acc etc gaa etc acc gtq ctg gtc acc gac 228 
Ser Leu Phe Asp Thr Glu Thr Leu Glu Leu Thr val Leu val Thr Asp 
-105 -100 -95 

gcc tec gcc gtc gag gcg gtc gag gee ace gga gcc cag gee ace gtc 276 
Ala ser Ala val Glu Ala Val Glu Ala Thr Gly Ala Gin Ala Thr val 
-90 -85 -80 -75 



gtc tec cac ggc acc gag ggc ctg ace gag gtc gtg gag gac etc aac 

val Ser His Giy Thr Glu Gly Leu Thr Glu Val Val Glu Asp Leu Asn 

-70 -65 -60 

ggc gcc gag gtt cec gag age gtc etc ggc tgg tac ccg gac gtq gag 

Gly Ala Glu val Pro Glu Ser val Leu Gly Trp Tyr Pro Asp val Glu 

-55 -50 -45 

age gac ace gtc gtg gtc gag gtg ctg gag ggc tec gac gcc gac gtc 

ser Asp Thr val Val val Glu Val Leu Glu Gly ser Asp Ala Asp Val 

-40 -35 -30 



teg gtg acc ggt acc age cag gee ccg gcc ggc teg gcc gtg tgc cgc 
Ser Val Thr Gly Thr ser Gin Ala Pro Ala Giy ser Ala Val cys Arg 

90 95 100 



324 



372 



420 



gcc gee etg etc gee gac gee ggt gtg gac tec tec teg gtc egg gtg 468 
Ala Ala Leu Leu Ala Asp Ala Gly val Asp ser Ser ser Val Arg val 
-25 -20 -15 

gag gag gcc gag gag gcc ccg cag gtc tac gee gac ate ate ggc ggc 516 
Glu Glu Ala Glu Glu Ala Pro Gin Val Tyr Ala Asp lie lie Gly Gly 
-10 -5 -11 5 

ctg gcc tac tac atg ggc ggc cgc tgc tec gtc ggc ttc gcc gcg acc 564 
Leu Ala Tyr Tyr Met Gly Gly Arg cys Ser Val Gly Phe Ala Ala Thr 

10 15 20 

aac age gcc ggt cag cec ggt ttc gtc acc gee ggc cac tgc ggc ace 612 
Asn ser Ala Gly Gin Pro Gly Phe val Thr Ala Gly His Cys Gly Thr 
25 30 35 

gtc ggc acc ggc gtg acc ate ggc aac ggc ace ggc ace ttc cag aac .660 
Val Gly Thr Gly val Thr lie Gly Asn Gly Thr Gly Thr Phe Gin Asn 
40 45 50 

teg gtc ttc cec ggc aac gac gcc gcc ttc gtc cgc ggc acc tec aac 708 
Ser Val Phe Pro Gly Asn Asp Ala Ala Phe val Arg Gly Thr ser Asn 
55 60 65 70 

ttc acc etg acc aac ctg gtc teg cgc tac aac tec ggc ggc tac cag 756 
Phe Thr Leu Thr Asn Leu Val Ser Arg Tyr Asn ser Gly Gly Tyr Gin 

75 80 85 



804 



tec ggc tec acc acc ggc tgg cac tgc ggc acc ate cag gee cgc aac 852 
ser Gly ser Thr Thr Gly Trp His cys Gly Thr lie Gin Ala Arg Asn 
105 110 115 
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cag acc gtg cgc tac ccg cag ggc acc gtc tac teg etc acc cgc acc 
Gin Thr val Arg Tyr Pro Gin Gly Thr val Tyr ser Leu Thr Arg Thr 
120 125 130 



900 



aac gtg tgc gcc gag ccc ggc gac tec ggc gqt teg tte ate tee ggc 948 
Asn Val Cys Ala Glu Pro Gly Asp Ser Gly Gly Ser Phe lie Ser Gly 
135 140 145 150 

teg cag gcc cag ggc gtc acc tee ggc ggc tec ggc aac tgc tee gtc 996 
ser Gin Ala Gin Gly val Thr Ser Gly Gly ser Gly Asn Cys ser val 

155 160 165 

ggc ggc aeg acc tac tac cag gag gtc acc ccg atg ate aac tec tgg 1044 
Gly Gly Thr Thr Tyr Tyr Gin Glu Val Thr Pro Met lie Asn ser Trp 

170 175 180 

ggt gtc agg ate egg ace taa 1065 
Gly val Arg lie Arg Thr 
185 

<210> 4 
<211> 354 
<212> PRT 

<213> Nocardlopsis dassonvillei subspecies dassonvillei DSM 43235 ("Protease 
18") 

<400> 4 

Ala Pro Ala Pro Val Pro Gin Thr Pro val Ala Asp Asp ser Ala 
-165 -160 -155 

Ala Ser Met Thr Glu Ala Leu Lys Arg Asp Leu Asp Leu Thr Ser 
-150 -145 -140 

Ala Glu Ala Glu Glu Leu Leu ser Ala Gin Glu Ala Ala He Glu 
-135 -130 -125 

Thr Asp Ala Glu Ala Thr Glu Ala Ala Gly Glu Ala Tyr Gly Gly 
-120 -115 -110 

ser Leu Phe Asp Thr Glu Thr Leu Glu Leu Thr val Leu val Thr Asp 
-105 -100 -95 

Ala Ser Ala Val Glu Ala val Glu Ala Thr Gly Ala Gin Ala Thr Val 
-90 -85 -80 -75 

Val Ser His Gly Thr Glu Gly Leu Thr Glu val val Glu Asp Leu Asn 

-70 -65 -60 

Gly Ala Glu Val Pro Glu Ser Val Leu Gly Trp Tyr Pro Asp val Glu 

-55 -50 -45 

Ser Asp Thr val val val Glu val Leu Glu Gly ser Asp Ala Asp val 
-40 -35 -30 

Ala Ala Leu Leu Ala Asp Ala Gly val Asp ser ser ser Val Arg val 
-25 -20 -15 
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Glu Glu Ala Glu Glu Ala Pro Gin val Tyr Ala Asp He lie Gly Gly 
-10 -5 -11 5 

Leu Ala Tyr Tyr Met Gly Gly Arg cys ser val Gly Phe Ala Ala Thr 

10 15 20 

Asn ser Ala Gly Gin Pro Gly Phe Val Thr Ala Gly His Cys Gly Thr 
25 30 35 

val Gly Thr Gly val Thr He Gly Asn Gly Thr Gly Thr Phe Gin Asn 
40 45 50 

Ser Val Phe Pro Gly Asn Asp Ala Ala Phe val Arg Gly Thr Ser Asn 
55 60 65 70 

Phe Thr Leu Thr Asn Leu val ser Arg Tyr Asn Ser Gly Gly Tyr Gin 

75 80 85 

ser Val Thr Gly Thr Ser Gin Ala Pro Ala Gly Ser Ala val Cys Arg 

90 95 100 

ser Gly ser Thr Thr Gly Trp His cys Gly Thr lie Gin Ala Arg Asn 
105 110 115 

Gin Thr val Arg Tyr Pro Gin Gly Thr val Tyr Ser Leu Thr Arg Thr 
120 125 130 

Asn val cys Ala Glu Pro Gly Asp Ser Gly Gly ser Phe He Ser Gly 
135 140 145 150 

Ser Gin Ala Gin Gly val Thr Ser Gly Gly Ser Gly Asn cys ser val 

155 160 165 

Gly Gly Thr Thr Tyr Tyr Gin Glu val Thr Pro Met lie Asn ser Trp 

170 175 180 

Gly Val Arg lie Arg Thr 
185 

<210> 5 
<211> 1062 
<212> DNA 

<213> Nocardiopsis prasina DSM 15648 ("Protease 11") 

<220> 

<221> CDS 

<222> CD . . (1059) 

<220> 

<221> mat_peptide 
<222> (496) . . (1059) 

Page 7 



wo 2005/035747 PCT/DK2004/000688 

10508. 204- WO. ST2 5 

<400> 5 

gcc acc gqa ccg etc ccc cag tea eee aee ccg gag gee gae gee 45 
Ala Thr Gly Pro Leu Pro Gin Ser Pro Thr Pro Glu Ala Asp Ala 
-165 -160 -155 

gtc tee atg cag gag gcg etc cag cgc gac etc ggc etg ace ccg 90 
val ser Met Gin Glu Ala Leu Gin Arg Asp Leu Gly Leu Thr pro 
-150 -145 -140 

ett gag gcc gat gaa etg etg gee gcc cag gae ace gcc tte gag 135 

Leu Glu Ala Asp Glu Leu Leu Ala Ala Gin Asp Thr Ala Phe Glu 
-1^*; -i^n 



gtc gac gag gee gcg gee gcg gee gcc ggg gac gee tac ggc ggc 
val Asp Glu Ala Ala Ala Ala Ala Ala Gly Asp Ala Tyr Gly Gly 
-120 -115 -110 



age ggc etg etc gcc gac gee ggc gtg gac gee teg gcc gtc gag gtg 
ser GJy Leu Leu Ala Asp Ala Gly val Asp Ala ser Ala val Glu Val 
-25 -20 -15 -10 



gee tac aee atg ggc ggc cgc tgt teg gtc gga tte gcg gee ace aac 
Ala Tyr Thr Met Gly Gly Arg cys ser val Gly Phe Ala Ala Thr Asn 
10 15 20 



180 



tee gtc tte gac ace gag ace etg gaa etg ace gtc etg gtc aee gae 228 
ser val Phe Asp Thr Glu Thr Leu Glu Leu Thr val Leu Val Thr Asp 

-105 -100 -95 -90 

gee gcc teg gtc gag get gtg gag gee aee ggc gcg ggt ace gaa etc 276 

Ala Ala ser val Glu Ala Val Glu Ala Thr Gly Ala Gly Thr Glu Leu 

-85 -80 -75 

gtc tec tac ggc ate gag ggc etc gae gag ate ate cag gat etc aac 324 

Val Ser Tyr Gly lie Glu Gly Leu Asp Glu lie lie Gin Asp Leu Asn 

-70 -65 -60 

gcc gcc gac gee gtc ccc ggc gtg gtc ggc tgg tac ccg gac gtg gcg 372 

Ala Ala Asp Ala val Pro Gly val val Gly Trp Tyr Pro Asp val Ala 

-55 -50 -45 

ggt gac ace gtc gtc etg gag gtc etg gag ggt tec gga gee gac gtg 420 

Gly Asp Thr val val Leu Glu Val Leu Glu Gly Ser Gly Ala Asp VaT 

-40 -35 -30 



468 



acc age agt gcg cag ccc gag etc tac gee gae ate ate ggc ggt etg 516 
Thr Ser Ser Ala Gin Pro Glu Leu Tyr Ala Asp lie lie Gly Gly Leu 

-5 -11 5 



564 



gee gee ggt cag cee gga tte gtc aee gee ggt eac tgt ggc cgc gtg 612 
Ala Ala Gly Gin Pro Gly Phe val Thr Ala Gly His cys Gly Arg Val 
25 30 35 

ggc acc cag gtg age ate ggc aac ggc cag ggc gtc tte gag cag tec 660 
Gly Thr Gin Val Ser He Gly Asn Gly Gin Gly val Phe Glu Gin Ser 
40 45 50 55 

ate tte ccg ggc aac gae gee gee tte gtc cgc ggc acg tec aac tte 708 
lie Phe Pro Gly Asn Asp Ala Ala Phe val Arg Giy Thr ser Asn Phe 

60 65 70 

acg etg ace aac etg gtc age cgc tac aac ace ggc ggt tac gee acc 756 
Thr Leu Thr Asn Leu Val Ser Arg Tyr Asn Thr Gly Gly Tyr Ala Thr 

75 80 85 

gtc gee ggc eac aac cag gcg cee ate ggc tec tec gtc tgc cgc tec 804 
Val Ala Gly His Asn Gin Ala Pro lie Gly ser Ser Val cys Arg ser 
90 95 100 
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gqc tec acc acc ggc tgg cac tgc ggc acc ate cag gee cgc ggc cag 852 
Gly ser Thr Thr Gly Trp His cys Gly Thr lie Gin Ala Arg Gly Gin 
105 110 115 

teg gtg age tac cec gag ggc acc gtc ace aae atg ace egg ace acc 900 
Ser Val Ser Tyr Pro Glu Gly Thr val Thr Asn Met Thr Arg Thr Thr 
120 125 130 135 

gtg tgc gee gag cec ggc gac tec ggc ggc tec tac ate tee ggc aac 948 
val Cys Ala Glu Pro Gly Asp ser Gly Gly ser Tyr lie Ser Gly Asn 

140 145 150 

eag gee cag ggc gtc acc tec ggc ggc tec ggc aac tgc cgc ace ggc 996 
Gin Ala Gin Gly val Thr ser Gly Gly ser Gly Asn cys Arg Thr Gly 

155 160 165 

ggg acc ace tte tac eag gag gtc acc cec atg gtg aae tec tgg ggc 1044 
Gly Thr Thr Phe Tyr Gin Glu Val Thr Pro Met Val Asn Ser Trp Gly 
170 175 180 

gtc cgt etc egg acc taa 1062 
val Arg Leu Arg Thr 
185 

<210> 6 

<211> 353 

<212> PRT 

<213> Nocardiopsis prasina DSM 15648 ("Protease 11") 

<400> 6 

Ala Thr Gly Pro Leu Pro Gin Ser Pro Thr Pro Glu Ala Asp Ala 
-165 -160 -155 

val Ser Met Gin Glu Ala Leu Gin Arg Asp Leu Gly Leu Thr Pro 
-150 -145 -140 

Leu Glu Ala Asp Glu Leu Leu Ala Ala Gin Asp Thr Ala Phe Glu 
-135 -130 -125 

Val Asp Glu Ala Ala Ala Ala Ala Ala Gly Asp Ala Tyr Gly Gly 
-120 -115 -110 

« 

ser Val Phe Asp Thr Glu Thr Leu Glu Leu Thr Val Leu Val Thr Asp 
-105 -100 -95 -90 

Ala Ala ser val Glu Ala Val Glu Ala Thr Gly Ala Gly Thr Glu Leu 

-85 -80 -75 

val Ser Tyr Gly lie Glu Gly Leu Asp Glu lie lie Gin Asp Leu Asn 

-70 -65 -60 

Ala Ala Asp Ala Val Pro Gly Val val Gly Trp Tyr Pro Asp val Ala 
-55 -50 -45 

Gly Asp Thr Val Val Leu Glu Val Leu Glu Gly ser Gly Ala Asp val 
-40 -35 -30 
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Ser Gly Leu Leu Ala Asp Ala Gly Val Asp Ala Ser Ala Val Glu Val 
-25 -20 -15 -10 

Thr ser ser Ala Gin Pro Glu Leu Tyr Ala Asp lie lie Gly Gly Leu 

-5 -11 5 

Ala Tyr Thr Met Gly Gly Arg Cys ser Val Gly Phe Ala Ala Thr Asn 
10 15 20 

Ala Ala Gly Gin Pro Gly Phe val Thr Ala Gly His cys Gly Arg val 
25 30 35 

Gly Thr Gin Val ser lie Gly Asn Gly Gin Gly val Phe Glu Gin ser 
40 45 50 55 

He Phe Pro Gly Asn Asp Ala Ala Phe Val Arg Gly Thr ser Asn Phe 

60 65 70 

Thr Leu Thr Asn Leu Val ser Arg Tyr Asn Thr Gly Gly Tyr Ala Thr 

75 80 85 

Val Ala Gly His Asn Gin Ala Pro lie Gly Ser ser val cys Arg ser 
90 95 100 

Gly ser Thr Thr Gly Trp His cys Gly Thr lie Gin Ala Arg Gly Gin 
105 110 115 

ser val ser Tyr Pro Glu Gly Thr Val Thr Asn Met Thr Arg Thr Thr 
120 125 130 135 

Val Cys Ala Glu Pro Gly Asp ser Gly Gly ser Tyr He Ser Gly Asn 

140 145 150 

Gin Ala Gin Gly val Thr ser Gly Gly Ser Gly Asn Cys Arg Thr Gly 

155 160 165 

Gly Thr Thr Phe Tyr Gin Glu Val Thr Pro Met Val Asn ser Trp Gly 
170 175 180 

Val Arg Leu Arg Thr 
185 

<210> 7 

<211> 1062 

<212> DNA 

<213> Nocardiopsis praslna DSM 15649 ("Protease 35") 
<220> 

<221> CDS 

<222> CD - . C1059) 

<220> 
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<221> mat_pepti de 
<222> (496) . . C1059) 

<400> 7 

gcc acc gga cca etc ccc cag tea ccc acc ccg gag gcc gac gcc 45 

Ala Thr Gly Pro Leu Pro Gin Ser Pro Thr Pro Glu Ala Asp Ala 
-165 -160 -155 

gtc tec atg cag gag gcg etc cag egc gac etc ggc ctg acc ccg 90 

val ser Met Gin Glu Ala Leu Gin Arg Asp Leu Gly Leu Thr pro 
-150 -145 -140 

ett gag gee gat gaa ctg ctg gee gee cag gac acc gcc ttc gag 135 

Leu Glu Ala Asp Glu Leu Leu Ala Ala Gin Asp Thr Ala Phe Glu 
-135 -130 -125 

gtc gac gag gcc gcg gcc gag gcc gcc got gac gcc tac ggc ggc 180 

Val Asp Glu Ala Ala Ala Glu Ala Ala Gly Asp Ala Tyr Gly Gly 
-120 -115 -110 

tec gtc ttc gac acc gag acc ctg gaa ctg acc gtc ctg gtc acc gac 228 

ser Val Phe Asp Thr Glu Thr Leu Glu Leu Thr val Leu val Thr Asp 
-105 -100 -95 -90 

tec gcc gcg gtc gag gcg gtg gag gcc acc ggc gcc ggg acc gaa ctg 276 
ser Ala Ala val Glu Ala Val Glu Ala Thr Gly Ala Gly Thr Glu Leu 

-85 -80 -75 

ate gtc gag gag etc aac 324 

Tlck /Till nltt Ley Asn 



gtc tec tac ggc ate acg ggc etc gac gag ate gtc gag gag 

Val Ser Tyr Gly lie Thr Gly Leu Asp Glu lie Val Glu Glu 

-70 -65 -60 

gee gcc gac gcc gtt ccc ggc gtg gtc ggc tgg tac ccg gac gtc gcg 372 

Ala Ala Asp Ala Val Pro Gly vat val Gly Trp Tyr Pro Asp val Ala 
-55 -50 -45 

ggt gac ace gtc gtg ctg gag gtc ctg gag ggt tec ggc gcc gac gtg 420 

ly Asp Thr val Val Leu Glu Val Leu Glu Gly Ser Gl^ ^ 



Gly Asp Thr val Val Leu Glu Val Leu Glu Gly Ser Gly Ala Asp val 
-40 -35 -30 

ggc ggc ctg etc gcc gac gcc ggc gtg gac gee teg gcg gtc gag gtg 

Gly Gly Leu Leu Ala Asp Ala Gly val Asp Ala Ser Ala val Glu Val 

-25 -20 -15 -10 



Ala Tyr Thr Met Gly Gly Arg cys ser val Gly Phe Ala Ala Thr Asn 
10 15 20 

gee gee ggt cag ccc ggg ttc gtc ace gcc ggt cac tgt ggc egc gtg 

Ala Ala Gly Gin Pro Gly Phe val Thr Ala Gly His Cys Gly Arg val 

25 30 35 

ggc acc cag gtg acc ate ggc aac ggc egg ggc gtc ttc gag cag tec 

Gly Thr Gin val Thr lie Gly Asn Gly Arg Gly val Phe Glu Gin ser 
40 45 50 55 



468 



acc acc acc gag cag ccc gag ctg tac gcc gac ate ate ggc ggt ctg 516 

Thr Thr Thr Glu Gin Pro Glu Leu Tyr Ala Asp lie lie Gly Gly Leu 

-5 -11 5 

gcc tac acc atg ggc ggc egc tgt teg gtc ggc ttc gcg gee acc aac 564 

ly Gly Arg cys ser val Gl^ 



612 



660 



ate ttc ccg ggc aac gac gcc gcc ttc gtc egc gga acg tec aac ttc 708 
lie Phe Pro Gly Asn Asp Ala Ala Phe val Arg Gly Thr Ser Asn Phe 

60 65 70 

acg ctg acc aac ctg gtc age egc tac aac ace ggc ggc tac gcc ace 756 
Thr Leu Thr Asn Leu Val Ser Arg Tyr Asn Thr Gly Gly Tyr Ala Thr 

75 80 85 
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qtc gcc ggt cac aac cag gcg ccc ate ggc tec tec gtc tgc cgc tec 
Val Ala Gly His Asn Gin Ala Pro lie Gly ser sen val cys Arg Ser 
90 95 100 

gge tec ace ace ggt tgg cac tgc ggc ace ate cag gcc cgc ggc cag 
Gly ser Thr Thr Gly Trp His cys Gly Thr lie Gin Ala Arg Gly Gin 
105 110 115 

teg gtg age tac ccc gag ggc ace gtc ace aac atg acg egg acc ace 
ser val Ser Tyr Pro Glu Gly Thr Val Thr Asn Met Thr Arg Thr Thr 
120 125 130 135 

gtg tgc gcc gag ccc ggc gac tec ggc ggc tec tac ate tee ggc aac 
Val cys Ala Glu Pro Gly Asp ser Gly Gly ser Tyr He Ser Gly Asn 

140 145 150 

cag gcc cag ggc gtc acc tec ggc ggc tec ggc aac tgc cgc ace ggc 
Gin Ala Gin Gly val Thr Ser Gly Gly Ser Gly Asn Cys Arg Thr Gly 

155 160 165 

ggg ace ace ttc tac cag gag gtc ace ccc atg gtg aac tee tgg ggc 
Gly Thr Thr Phe Tyr Gin Glu val Thr Pro Met Val Asn Ser Trp Gly 
170 175 180 



gtc egt etc egg ace taa 
Val Arg Leu Arg Thr 



185 

<210> 8 

<211> 353 

<212> PRT 

<213> Noeardiopsis prasina DSM 15649 ("Protease 35") 

<400> 8 

Ala Thr Gly Pro Leu Pro Gin Ser Pro Thr Pro Glu Ala Asp Ala 
-165 -160 -155 

val ser Met Gin Glu Ala Leu Gin Arg Asp Leu Gly Leu Thr Pro 
-150 -145 -140 

Leu Glu Ala Asp Glu Leu Leu Ala Ala Gin Asp Thr Ala Phe Glu 
-135 -130 -125 

val Asp Glu Ala Ala Ala Glu Ala Ala Gly Asp Ala Tyr Gly Gly 
-120 -115 -110 

Ser val Phe Asp Thr Glu Thr Leu Glu Leu Thr val Leu val Thr Asp 
-105 -100 -95 -90 

Ser Ala Ala val Glu Ala val Glu Ala Thr Gly Ala Gly Thr Glu Leu 

-85 -80 -75 

Val Ser Tyr Gly lie Thr Gly Leu Asp Glu He Val Glu Glu Leu Asn 

-70 -65 -60 

Ala Ala Asp Ala val Pro Gly val val Gly Trp Tyr Pro Asp Val Ala 
-55 -50 -45 



804 



852 



900 



948 



996 



1044 



1062 
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Gly Asp Thr val Val Leu Glu val Leu Glu Gly ser Gly Ala Asp val 
-40 -35 -30 

Gly Gly Leu Leu Ala Asp Ala Gly val Asp Ala ser Ala val Glu val 
-25 -20 -15 -10 

Thr Thr Thr Glu Gin Pro Glu Leu Tyr Ala Asp lie lie Gly Gly Leu 

-5 -11 5 

Ala Tyr Thr Met Gly Gly Arg cys Ser val Gly Phe Ala Ala Thr Asn 
10 15 20 

Ala Ala Gly Gin Pro Gly Phe val Thr Ala Gly His Cys Gly Arg val 
25 30 35 

Gly Thr Gin Val Thr lie Gly Asn Gly Arg Gly val Phe Glu Gin ser 
40 45 50 55 

lie Phe Pro Gly Asn Asp Ala Ala Phe Val Arg Gly Thr Ser Asn Phe 

60 65 70 

Thr Leu Thr Asn Leu Val Ser Arg Tyr Asn Thr Gly Gly Tyr Ala Thr 

75 80 85 

val Ala Gly His Asn Gin Ala Pro lie Gly ser Ser Val Cys Arg ser 
90 95 100 

Gly ser Thr Thr Gly Trp His cys Gly Thr lie Gin Ala Arg Gly Gin 
105 110 115 

ser val ser Tyr Pro Glu Gly Thr val Thr Asn Met Thr Arg Thr Thr 
120 125 130 135 

val Cys Ala Glu Pro Gly Asp ser Gly Gly ser Tyr lie ser Gly Asn 

140 145 150 

Gin Ala Gin Gly Val Thr Ser Gly Gly ser Gly Asn cys Arg Thr Gly 

155 160 165 

Gly Thr Thr Phe Tyr Gin Glu val Thr Pro Met val Asn Ser Trp Gly 
170 175 180 

Val Arg Leu Arg Thr 
185 

<210> 9 

<211> 1068 

<212> DNA 

<213> Nocardiopsis alba DSM 15647 C"Protease 08") 
<220> 

<221> CDS 
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<222> CD . . C1065) 
<220> 

<221> mat_peptide 
<222> (502). . (1065) 

<400> 9 

gcg acc gqc ccc etc ccc cag tec ccc acc ccg gat gaa gcc gag 45 

Ala Thr Gly Pro Leu Pro Gin ser Pro Thr Pro Asp Glu Ala Glu 
-165 -160 "155 

gcc acc acc atg gtc gag gcc etc cag cgc gac etc ggc ctg tec 90 
Ala Thr Thr Met Val Glu Ala Leu Gin Arg Asp Leu Gly Leu Ser 
-150 -145 -140 

ccc tet cag gee gac gag etc etc gag gcg cag gee gag tec ttc 135 
Pro ser Gin Ala Asp Glu Leu Leu Glu Ala Gin Ala Glu ser Phe 
-135 -130 -125 

gag ate gac gag gcc gee acc gcg gcc gea gcc gac tec tac ggc 180 
Glu He Asp Glu Ala Ala Thr Ala Ala Ala Ala Asp ser Tyr Gly 
-120 -115 -110 

ggc tec ate ttc gac ace gac age etc acc ctg ace gtc ctg gtc ace 228 
Gly Ser lie Phe Asp Thr Asp Ser Leu Thr Leu Thr val Leu val Thr 
-105 -100 -95 

gac gcc tee gee gtc gag gcg gtc gag gcc gee ggc gee gag gcc aag 276 
Asp Ala Ser Ala val Glu Ala val Glu Ala Ala Gly Ala Glu Ala Lys 
-90 -85 -80 

gtg gtc teg cae ggc atg gag ggc ctg gag gag ate gtc gee gac ctg 324 
Val val ser His Gly Met Glu Gly Leu Glu Glu He Val Ala Asp Leu 
-75 -70 -65 -60 



aac gcg gcc gac get cag ccc ggc gtc gtg ggc tgg tac eee gac ate 

Asn Ala Ala Asp Ala Gin Pro Gly val VaT Gly Trp Tyr Pro Asp He 

-55 -50 -45 

cac tee gac aeg gtc gtc etc gag gtc etc gag ggc tee gqt gee gac 

His ser Asp Thr Val Val Leu Glu Val Leu Glu Gly ser Gly Ala Asp 

-40 -35 -30 



acc aac gcc tee ggc cag ccc ggg ttc gtc ace gee ggc cae tge ggc 
Thr Asn Ala ser Gly Gin Pro Gly Phe val Thr Ala Gly His Cys Gly 

25 30 35 



372 



420 



gtg gac tec ctg etc gcc gac gee ggt gtg gac acc gee gac gtc aag 468 
val Asp Ser Leu Leu Ala Asp Ala Gly VaT Asp Thr Ala Asp val Lys 
-25 -20 -15 

gtg gag age acc ace gag cag ccc gag ctg tac gcc gac ate ate ggc 516 
Val Glu Ser Thr Thr Glu Gin pro Glu Leu Tyr Ala Asp He lie Gly 
-10 -5 -11 5 

ggt etc gee tac ace atg ggt ggg cgc tge teg gtc ggc ttc gcg gee 564 
Gly Leu Ala Tyr Thr Met Gly Gly Arg cys ser val Gly Phe Ala Ala 

10 15 20 



612 



ace gtc ggc acc ccg gtc age ate ggc aac ggc cag ggc gtc ttc gag 660 

Thr val Gly Thr pro val ser lie Gly Asn Gly Gin Gly val Phe Glu 
40 45 50 

cgt tec gtc ttc ccc ggc aac gac tec gee ttc gtc cgc ggc acc teg 708 

Arg ser Val Phe Pro Gly Asn Asp ser Ala Phe Val Arg Gly Thr Ser 
55 60 65 

aac ttc acc ctg acc aac ctg gtc age cgc tac aac ace ggt ggt tac 756 
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Asn Phe Thr Leu Thr Asn Leu Val Ser Arg Tyr Asn Thr Gly Gly Tyr 
70 75 80 85 

gcg acc gtc tec ggc tec teg eag gcg gcg ate ggc teg cag ate tgc 

Ala Thr val ser Gly ser ser Gin Ala Ala lie Gly Ser Gin lie Cys 

90 95 100 

egt tec ggc tec ace ace ggc tgg eac tgc ggc acc gtc cag gee cge 

Arg Ser Gly ser Thr Thr Gly Trp His Cys Gly Thr val Gin Ala Arg 

105 110 115 

ggc cag aeg gtg age tac cec eag ggc acc gtg cag aac etg ace egc 

Gly Gin Thr val Ser Tyr Pro Gin Gly Thr val Gin Asn Leu Thr Arg 
120 125 130 

ace aac gtc tgc gee gag cec ggt gac tec ggc gge tec tte ate tee 

Thr Asn Val Cys Ala Glu Pro Gly Asp ser Gly Gly Ser Phe lie Ser 

135 140 145 



804 



852 



900 



948 



gge age eag gee eag ggc gtc acc tec ggt ggc tee ggc aac tgc tec 996 
Gly ser Gin Ala Gin Gly Val Thr Ser Gly Gly ser Gly Asn Cys ser 
150 155 160 165 

tte ggt gge acc ace tac tac cag gag gtc aac ccg atg etg age age 1044 
Phe Gly Gly Thr Thr Tyr Tyr Gin Glu Val Asn Pro Met Leu ser Ser 

170 175 180 

tgg ggt etg acc etg cge acc tga 1068 
Trp Gly Leu Thr Leu Arg Thr 

185 

<210> 10 
<211> 355 
<212> PRT 

<213> Nocardiopsis alba DSM 15647 ("Protease 08") 
<400> 10 

Ala Thr Gly Pro Leu Pro Gin ser Pro Thr Pro Asp Glu Ala Glu 
-165 -160 -155 

Ala Thr Thr Met Val Glu Ala Leu Gin Arg Asp Leu Gly Leu ser 
-150 -145 -140 

Pro Ser Gin Ala Asp Glu Leu Leu Glu Ala Gin Ala Glu ser Phe 
-135 -130 -125 

Glu lie Asp Glu Ala Ala Thr Ala Ala Ala Ala Asp ser Tyr Gly 
-120 -115 -110 

Gly Ser He Phe Asp Thr Asp Ser Leu Thr Leu Thr Val Leu Val Thr 
-105 -100 -95 

Asp Ala ser Ala val Glu Ala val Glu Ala Ala Gly Ala Glu Ala Lys 
-90 -85 -80 

Val val Ser His Gly Met Glu Gly Leu Glu Glu lie val Ala Asp Leu 
-75 -70 -65 -60 



Asn Ala Ala Asp Ala Gin Pro Gly val val Gly Trp Tyr Pro Asp lie 
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His ser ASP Thr Val Val Leu Glu val Leu Glu Gly ser Gly Ala Asp 

-40 -35 -30 

Val ASP ser Leu Leu Ala Asp Ala Gly val Asp Thr Ala Asp val Lys 
-25 -20 -15 

Val Glu ser Thr Thr Glu Gin Pro Glu Leu Tyr Ala Asp He He Gly 
-10 -5 -11 5 

Gly Leu Ala Tyr Thr Met Gly Gly Arg cys ser val Gly Phe Ala Ala 

10 15 20 

Thr Asn Ala Ser Gly Gin Pro Gly Phe val Thr Ala Gly His cys Gly 

25 30 35 

Thr val Gly Thr Pro Val ser lie Gly Asn Gly Gin Gly val Phe Glu 
40 45 50 

Arg ser val Phe Pro Gly Asn Asp Ser Ala Phe val Arg Gly Thr ser 
55 60 65 

Asn Phe Thr Leu Thr Asn Leu Val Ser Arg Tyr Asn Thr Gly Gly Tyr 
70 75 80 85 

Ala Thr val ser Gly ser ser Gin Ala Ala He Gly ser Gin lie Cys 

90 95 100 

Arg ser Gly ser Thr Thr Gly Trp His cys Gly Thr val Gin Ala Arg 

105 110 115 

Giv Gin Thr val Ser Tyr Pro Gin Gly Thr Val Gin Asn Leu Thr Arg 
120 125 130 

Thr Asn val cys Ala Glu Pro Gly Asp ser Gly Gly Ser Phe He Ser 
135 140 145 

Gly ser Gin Ala Gin Gly Val Thr Ser Gly Gly ser Gly Asn Cys Ser 
150 155 160 165 

Phe Giv Gly Thr Thr Tyr Tyr Gin Glu val Asn Pro Met Leu ser ser 

170 175 180 

Trp Gly Leu Thr Leu Arg Thr 

185 

<210> 11 
<211> 27 
<212> DNA 
<213> Arti f i ci al 
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<220> 

<223> Primer 
<400> 11 

ccgattatgg agcggattga acatgcg 



27 



<210> 
<211> 
<212> 
<213> 



12 
31 
DNA 

Artificial 



<220> 

<223> Primer 
<400> 12 

gtgaccatcg gcgacggcag gggcgtcttc g 



31 



<210> 
<211> 
<212> 
<213> 

<220> 
<223> 



13 

10172 
DNA 

Arti f i ci a1 



Expression cassette 



<220> 
<221> 
<222> 
<223> 



<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 



<220> 
<221> 
<222> 

<220> 
<221> 
<222> 

<220> 
<221> 
<222> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 



source 

(1).,C3323) . _ 

Bacillus subtilis genome sequence including 
yfmH-yfmD-yfmc-yfmB-yfmA genes 

misc^recomb 

(3561). -(4208) . . n 

cat gene providing chloramphenicol resistance 

promoter 

C4523) . . (5633) l -i • • 

Triple PamyL-scBAN-crylilA promoter including mRNA stabilizing 

sequence 

sig-peptide 
(5658) -.(5738) 



CDS 

(5658).. (6797) 



mat^peptide 
(6234) . . (6797) 



source 

(6839). .(7540) . 

Part of Bacillus subtilis pectate lyase gene 



source 

Bacillus subtilis genome DNA inclding yflS-citM genes 



<400> 13 
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gagtatcgcc 


agtaaggggc gtttttgttt 


tctggttgtt 


ttcttcattt 


caggtttcgc 


60 


cctttccttg 


ccaaatataa gaaaaacggc 


gttccgataa 


tcgcggtgac 


aatgccgacc 


120 


ggtgattcat 


aaggaaatgc aatccatctg 


gccagaacat 


ctgcgtacac 


cagcaaaatg 


180 


gcaccgaaca 


gtgccgaaaa cggaagcacg 


tattgataat 


gttctccgat 


cagcttgcgg 


240 


acaatatgcg 


ggacgagcag cccgacaaag 


ccaatcggcc 


cggcgacggc 


tacggaagcg 


300 


ccggaaagaa 


ttaaaataat caaactgatc agaatcctga tgccgttcat attttgtcca 


360 


agcccttttg 


ctgtttcgtc tccgagaccg 


agaacagaaa 


cagaaccgga 


aaatacgagg 


420 


gcaagcccga 


tgccaatgac agaaaaagga gcgatggtta tgacgtcctg ccagttgctg 


480 


ccgtcgattg 


cgcctgtcat ccagtacaga 


acatcctcac 


ctgactcatt 


taaaataatg 


540 


atggcctgtg 


tcatagagga gaggaacaag 


tgcacggcca 


ttcctgacag 


cgccagcttg 


600 


acaggcgtca ttccgccgga tgaggcaatc 


atatacacaa 


tcgcgccgcc 


tgctgccgca 


660 


cccgcaaaag 


cgaatataac agatgaatag 


ggcgatgccg 


gcagaatgac 


gagagaagca 


720 


acaacaaaaa 


gcgatgcacc cgcattcaca ccgaaaattt ggggtgaagc cagaggattt 


780 


ctggtcatag 


cctgcatcag cgcccctgct 


acagctaggc 


tggcgccgac 


aaaaacgccg 


840 


attaatgtgc 


ggggaaggcg aagagtagag 


atgatgagct 


gttcctttga accgtcccat 


900 


acaaaaagat 


atttcaatga atctatgatg 


ctgatgtctg 


aggctcctac 


tgaaagattc 


960 


agcccaagcc 


caaatataaa aataatcagt 


gcaatgataa 


acatcatcag 


tcttgatgat 


1020 


gagcgccgtt tggctgaatg atacaacagt 


ctcacttcct 


tactgcgtct 


ggttgcaaaa 


1080 


acgaagaagc aaggattccc ctcgcttctc atttgtccta 


tttattatac 


acttttttaa 


1140 


gcacatcttt 


ggcgcttgtt tcactagact 


tgatgcctct 


gaatcttgtc 


caagtgtcac 


1200 


ggtccgcatc atagacttgt ccatttttca 


ccgctttgag 


atttttccag 


agcgggttcg 


1260 


ttttccactc atctacaatg gttttgcctt 


cgttggctga 


gatgaacaaa 


atatcaggat 


1320 


cgattttgct 


caattgctca aggctgacct 


cttgataggc 


gttatctgac 


ttcacagcgt 


1380 


gtgtaaagcc 


tagcatttta aagatttctc 


cgtcatagga 


tgatgatgta 


tgaagctgga 


1440 


aggaatccgc 


tcttgcaacg ccgagaacga 


tgttgcggtt 


ttcatctttc 


ggaagttcgg 


1500 


cttttagatc 


gttgatgact tttttgtgct 


cggcaagctt 


ttcttttcct 


tcatcttctt 


1560 


tatttaatgc 


tttagcaatg gtcgtaaagc 


tgtcgatcgt 


ttcgtcatat 


gtcgcttcac 


1620 


ggctttttaa ttcaatcgtc ggggcgattt ttttcagctg tttataaatg 


tttttatggc 


1680 


gctcagcgtc 


agcgatgatt aaatcaggct 


tcaaggaact 


gatgacctca 


agattgggtt 


1740 


cgctgcgtgt 


gcctacagat gtgtaatcaa 


tggagctgcc 


gacaagcttt 


ttaatcatat 


1800 


cttttttgtt 


gtcatctgcg atgcccaccg 


gcgtaatgcc 


gagattgtga 


acggcatcca 


1860 


agaatgaaag 


ctcaagcaca accacccgct 


^aggtgtgcc 


gcttactgtc 


gtttttcctt 


1920 


cttcgtcatg 


gatcactctg gaatccttag 


actcgctttt 


gccgcttccg 


ttgttattct 


1980 


ggcttgatga acagccggat acaatgaggc aggcgagcaa 


taaaacactc 


atgatggcaa 


2040 
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tcaacttgtt 


agaataggtg 


cgcatgtcat 


tcttcctttt 


ttcagattta 


gtaatgagaa 


2100 


tcattatcac 


atgtaacact 


ataatagcat 


ggcttatcat 


gtcaatattt 


ttttagtaaa 


2160 


gaaagctgcg 


tttttactgc 


tttctcatga 


aagcatcatc 


agacacaaat 


aagtggtatg 


2220 


cagcgttacc 


gtgtcttcga 


gacaaaaacg 


catgggcgtt 


ggctttagag 


gtttcgaaca 


2280 


tatcagcagt 


gacataagga 


aggagagtgc 


tgagataacc 


ggacaatttc 


ttttctattt 


2340 


catctgttag 


tgcaaattca 


atgtcgccga 


tattcatgat 


aatcgagaaa 


acaaagtcga 


2400 


tatcgatatg 


aaaatgttcc 


tcggcaaaaa 


ccgcaagctc 


gtgaattcct 


ggtgaacatc 


2460 


cggcacgctt 


atggaaaatc 


tgtttgacta 


aatcactcac 


aatccaagca 


ttgtattgct 


2520 


gttctggtga 


aaagtattgc 


attagacata 


cctcctgctc 


gtacggataa 


aggcagcgtt 


2580 


tcatggtcgt 


gtgctccgtg 


cagcggcttc 


tccttaattt 


tgatttttct 


gaaaataggt 


2640 


cccgttccta 


tcactttacc 


atggacggaa 


aacaaatagc 


tactaccatt 


cctcctgttt 


2700 


ttctcttcaa 


tgttctggaa 


tctgtttcag 


gtacagacga 


tcgggtatga 


aagaaatata 


2760 


gaaaacatga 


aggaggaata 


tcgacatgaa 


accagttgta 


aaagagtata 


caaatgacga 


2820 


acagctcatg 


aaagatgtag 


aggaattgca 


gaaaatgggt 


gttgcgaaag 


aggatgtata 


2880 


cgtcttagct 


cacgacgatg 


acagaacgga 


acgcctggct 


gacaacacga 


acgccaacac 


2940 


gatcggagcc 


aaagaaacag 


gtttcaagca 


cgcggtggga 


aatatcttca 


ataaaaaagg 


3000 


agacgagctc 


cgcaataaaa 


ttcacgaaat 


cggtttttct 


gaagatgaag 


ccgctcaatt 


3060 


tgaaaaacgc 


ttagatgaag 


gaaaagtgct 


tctctttgtg 


acagataacg 


aaaaagtgaa 


3120 


agcttgggca 


taaagcaagg 


aaaaaaccaa 


aaggccaatg 


tcggcctttt 


ggtttttttg 


3180 


cggtctttgc 


ggtgggattt 


tgcagaatgc 


cgcaatagga 


tagcggaaca 


ttttcggttc 


3240 


tgaatgtccc 


tcaatttgct 


attatatttt 


tgtgataaat 


tggaataaaa 


tctcacaaaa 


3300 


tagaaaatgg 


gggtacatag 


tggccatcat 


ggccagctag 


catgcacatg 


ggatctggga 


3360 


ccaataataa 


tgactagaga 


agaaagaatg 


aagattgttc 


atgaaattaa 


ggaacgaata 


3420 


ttggataaag 


tggggtattt 


ttaaaatata 


tatttatgtt 


acagtaatat 


tgacttttaa 


3480 


aaaaggattg 


attctaagaa 


gaaagcagac 


aagtaagcct 


cctaaattca 


ctttagataa 


3540 


aaatttagga 


ggcatatcaa 


atgaacttta 


ataaaattga 


tttagacaat 


tggaagagaa 


3600 


aagagatatt 


taatcattat 


ttgaaccaac 


aaacgacttt 


tagtataacc 


acagaaattg 


3660 


atattagtgt 


tttataccga 


aacataaaac 


aagaaggata 


taaattttac 


cctgcattta 


3720 


ttttcttagt 


gacaagggtg 


ataaactcaa 


atacagcttt 


tagaactggt 


tacaatagcg 


3780 


acggagagtt 


aggttattgg 


gataagttag 


agccacttta 


tacaattttt 


gatggtgtat 


3840 


ctaaaacatt 


ctctggtatt 


tggactcctg 


taaagaatga 


cttcaaagag 


ttttatgatt 


3900 


tatacctttc 


tgatgtagag 


aaatataatg 


gttcggggaa 


attgtttccc 


aaaacaccta 


3960 


tacctgaaaa 


tgctttttct 


ctttctatta 


ttccatggac 


ttcatttact 


gggtttaact 


4020 


taaatatcaa 


taataatagt 


aattaccttc 


tacccattat 


tacagcagga 


aaattcatta 


4080 
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ataaaggtaa 


ttcaatatat 


10508 . 204-WO 
ttaccgctat ctttacaggt 


.ST25 

acatcattct 


y Li. eg Lgaxg 




gttatcatgc 


aggattgttt 


atgaactcta 


ttcaggaatt 


gtcagatagg 


w.cx.aa tgac u 




ggcttttata 


atatgagata 


atgccgactg tactttttac agtcggtttt 


ciiaacgaxac 




attaataggt 


acgaaaaagc 


aacttttttt 


gcgcttaaaa 


ccagtcatac 


caanaacLxa 


VI t> A 


agggtaacta 


gcctcgccgg 


aaagagcgaa 


aatgcctcac 


atttgtgcca 


cctaaaaagg 




agcgatttac 


atatgagtta 


tgcagtttgt agaatgcaaa aagtgaaatc 


agctggaciia 




aaaggcatgg 


catgccttcg 


atagtttatt 


aatattagtg 


gagctcagtg 


agagcgaagc 


*rDUU 


gaacacttga 


ttttttaatt 


ttctatcttt 


tataggtcat 


tagagtatac 


L xa Liixg tcc 


*tDDU 


tataaactat 


ttagcagcat 


aatagattta 


ttgaataggt 


catttaagtt 


gagcax-ai. ca 


HD^ vr 


ggggaggaaa 


atcttggaga 


aatatttgaa 


gaacccgagg 


atctagatca 


ggxaccgcaa 


*tDOU 


cgttcgcaga 


tgctgctgaa 


gagattatta 


aaaagctgaa 


agcaaaaggc 


xaTicaa L Lgg 


A7An 


taactgtatc 


tcagcttgaa 


gaagtgaaga 


agcagagagg 


ctattgaata 


aaugagxaga 


*foUU 


aagcgccata 


tcggcgcttt 


tcttttggaa 


gaaaatatag 


ggaaaatggt 


ac L xg xnaaa 




aattcggaat 


atttatacaa 


tatcatatgt 


atcacattga 


aaggaggggc 


c xgc LyTicca 




gactgtccgc 


tgtgtaaaaa 


aaaggaataa 


aggggggttg 


acattatttt 


ac LgauaLy u 




ataatataat 


ttgtataaga 


aaatggaggg 


gccctcgaaa 


cgtaagatga 


aaccu Lagax 


DU4U 


aaaagtgctt 


tttttgttgc 


aattgaagaa 


ttattaatgt 


taagcttaat 


xaaagaxaax 


no 


atctttgaat 


tgtaacgccc 


ctcaaaagta 


agaactacaa 


aaaaagaata 


eg Lcaxaxag 


!>XDU 


aaatatgttt 


gaaccttctt 


cagattacaa 


atatattcgg 


acggactcta 


ccTcaaaiigc 




ttatctaact 


atagaatgac 


atacaagcac 


aaccttgaaa 


atttgaaaat 


ataacxacca 




atgaacttgt 


tcatgtgaat 


tatcgctgta 


tttaattttc 


tcaattcaat 


axaxaaxa xy 




ccaatacatt 


gttacaagta 


gaaattaaga 


cacccttgat 


agccttacta 


■4- ^ /~ ^ +• a a ^ a +" 


J *T V V 


gatgtagtat 


taaatgaata 


tgtaaatata 


tttatgataa 


gaagcgactt 


atttataatc 


5460 


attacatatt 


tttctattgg 


aatgattaag 


attccaatag 


aatagtgtat 


aaattattta 


5520 


tcttgaaagg 


agggatgcct 


aaaaacgaag 


aacattaaaa 


acatatattt 


gcaccgtcta 


5580 


atggatttat 


gaaaaatcat 


tttatcagtt 


tgaaaattat 


gtattatgga 


gctctgaaaa 


5640 


y 

aaaggagagg 


ataaaga atg aag aaa < 
Met Lys Lys i 
-190 


ccg ttg ggg 
Pro Leu Gly 


aaa att gtc gca age 
Lys lie val Ala Ser 
-185 


5690 



acc gca eta etc att tet gtt get ttt agt tea teg ate gca teg 5735 

Thr Ala Leu Leu lie ser Val Ala Phe Ser Ser ser lie Ala ser 
-180 -175 -170 

get gee ace gga geg etc cec eag tea ecc ace ccg gag gee gae 5780 

Ala Ala Thr Gly Ala Leu Pro Gin ser Pro Thr Pro Glu Ala Asp 
-165 -160 -155 

geg gtc tec atg eag gag geg etc eag egc gae etc gac ctg ace 5825 

Ala val ser Met Gin Glu Ala Leu Gin Arg Asp Leu Asp Leu Thr 
-150 -145 -140 
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tec gcc gag gcc gag gag ctg ctg gcc gcc cag gac acc gcc ttc 
ser Ala Glu Ala Glu Glu Leu Leu Ala Ala Gin Asp Thr Ala Phe 
-135 -130 -1^^ 

m fa^ IIS i?3 1^1 I^a g?3 ??S l?S i?? ?S 111 ^'r l^y 
-120 -115 --LJ-y 

ggc tec gtc ttc gac acc gag age ctg gaa ctg acc gtc ctg gtc acc 
Gly ser_ Val Phe Asp Thr Glu ser Leu Glu Leu Thr Val Leu val Thr 



-105 



tr, ?u Ifi Sif E?S f.^ i?2 Sf ^ ?S 1% IVy fu 



-90 -85 



ctg gtc tec tac ggc ate gac ggt etc gac gag ate gtc cag gag etc 
LeQ val ser Tyr Gly He Asp Gly Leu Asp Glu He Val Gin Glu Leu 

-70 -65 -ou 



aac Dcc acc aac acc gtt cee ggt gtg gtc ggc tgg tac ccg gac gtg 
lln Ala AlS aIp Ala val Pro 6?y vaT val GTy Trp Tyr Pro Asp vat 

-55 -50 



5870 



5915 



5963 



6011 



6059 



6107 



6155 



6203 



6251 



6299 



6347 



6395 



m ?Sf ?af [IS i?2 S3 ifu Ify If? i?? Ill s 

-40 -35 -3" 

ate age ggc ctg etc geg gac gee ggc gtg gac gcc teg gcc gtc gag 
val sir Gly Leu Leu Ala Asp Ala Gly val Asp Ala ser Ala val Glu 
-25 -20 -15 

gtg ace aeg age gac cag cee gag etc tac gcc gac ate ate ggt ggt 
Val Thr Thr ser Xsp Gin Pro Glu Leu Tyr Ala Asp He He Gly Gly 
-10 -5 -11 » 

era acc tac ace atg ggc ggc cge tgt teg gtc ggc ttc gcg gcc acc 
tlu Ala Tyr Thr Set GTy cTy Arg cys ser val GTy Phe Ala Ala Thr 

10 15 

aac acc acc aot cag cee ggg ttc gtc ace gee ggt cac tgc ggc cge 
SIS Ala Ala Gly Gin pro gT? Phe val Thr Ala Gty His cys GTy Arg 
25 30 33 

nrn nac acc caa ata ace ate ggc aac ggc agg ggc gtc ttc gag cag 
?3 G% Thr Gin v£T Thr He GTy Asn GTy Arg cTy val Phe Glu Gin 
40 45 50 

?S m ^l 115 ^l ESe f S?5 ify ?S? llr |IS 

55 60 65 

*-fr- arn cta acc aac eta ate age cge tac aac acc ggc ggg tac gcc 
Jhe lS Thr tin tlu val ser A?g T^r Asn Thr cTy GTy T^r Ala 

aca ate ace qgt cac aac cag gee cee ate ggc tee tee gtc tgc cge 6539 
Thr val Ala G?5 His Asn Gin Ala Pro He GTy Ser Ser val cys Arg 

90 95 -LO" 

i?5 ^?? m wi i?5 Ill i ''^i A?^ 1% 

105 110 -^^^ 

i-r-n nrn aor tac ccc oaq QQC acc gtc acc aac atg acc egg acc 
§lg ie? ?3 llr pro il2 cTy Thr val Thr Asn Met Thr Arg Thr 
120 125 130 
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?y J?^ i?2 i?5 ?S IS IJ in l?r ||g 

135 140 

^fl 3^ 1?^ §a? ?S I?5 § IS i?5 g§ ?fi? 

^L5 5 

i?5 i?? ?S S!? i?g ?i| ?S -5 ga «n IS 

170 1/5 -^"^ 

ggc gtc cgt etc egg ace taatcgcatg tteaateege tccataatcg 



6683 



Giy val Arg Leu Arg Thr 
185 



gtcgaegegg 
agcaeeatga 
aetattatca 
atgaeggeaa 
cgeegagagt 
gctetteaag 
eeeaaaaeaa 
gcgggggaac 
cggetgcaaa 
atgcttetge 
aagaaagtga 
attaaettag 
aectatttat 
tgcttattae 
aaectaaagc 
agccettgce 
eactateaat 



tcgcattctt 
tattcgttca 
atttaatact 
ctattatcag 
aaatcggtgc 
tcctgaeagc 
tggacttaae 
tcatcacgec 
cggcgaaaat 



eggttegegt: 
cggeeaaacg 
egateatgan 
attaaaaatu 
cegcttcggg 
ttatcetttt 
tgtcattgac 
ggetttatat 
egggetgagc 
taatgtgaaa 
aaaaeaeaaa 
ttaaggagta 
ggetteagaa 
tgtegetgtg 
ttggeatttg 
aatgggtgca 
tgaggataca 
tattteccgg 



gaaattcgga 
tteaeetgct 
atcattatce 
attettatta 
gatggcggcg 
atggacaage 
gettgtgatt 
cgcaacagaa 



ccggaeagca 
gatgettcea 
aaaagctcca 
acgctgcate 
caagtgeaeg 
agetatgcat 
gtaeegggac 
gaeteeggca 
tcttctgtcg 
teaaatgtta 
gggtgctaae 
gaatggaaaa 
aaagaegeag 
ggactaatca 
tttgcgattt 
attgcaattt 
ttaageggat 
ggatttatca 
aaaaaaaeee 



eatcaecgaa 
acggtgctaa 
tttteggatc 
ataaeegcta 
tatacaacaa 



atattatgga 
ctatatcacg 
aagtgacagc 
taaaaatatt 



attccaagta 
gaaacatteg 
aaaaeeggtt 
aacecgetga 
tgggeaattg 
tacaaactgt 



ggggaatcgg 
tgtcagctgc 
cgttgetgaa 
gctggaegce 
taaateaagc 
ctttgtgttt 
ggggategga 
gaaaacagtc 
tctggtttat 
ttgtegeaae 
ttgcattggc 
tegggaataa 
aaaccggtct 
ttggacttte 
ataeggegcg 



ctattatgaa 
aaagtcatet 
taaaacgatc 



cggcacacag 
gtctctgcat 
gggtgcgggt 
tttaattaat 
aaacaagtat 
agcagtaaag 
tcccgctccg 
aattatcgge 



agaaaatate 
atgtectaca 
aaaacctecg 
gteeagcgcg 
ggaageacaa 
aaaatetatg 
agcgtattca 
atcaaegeat 
ggatcgattg 
aaattaaatt 
taaaatgttt 
ataggaggag 
cttgttccat 
tecggaettg 
tttatcteca 



gateaageee 
tteaggggaa 
ttgceaagct 
ccgcgattgt 
atccgccgga 



ggttactgca 

gaceatttgg 

eggtgegaga 

ttattcactg 

tgcaggaggc 

ggcaaatgga 

tctgateaea 

ggcccatgat 

acegggaett 

aatcaaagaa 



6731 



aaactgaaag aaatgggacc gttcaaaaaa 
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ctaaetggaa 

cttatcgtta 

atttcgtatg 

ctattcagtg 

attatatttc 

aeagagagaa 

tetgctatgt 

gtcgcagggg 

gtaagcttaa 

acaecggatg 

tcggagcttt 



6779 



6827 



6887 

6947 

7007 

7067 

7127 

7187 

7247 

7307 

7367 

7427 

7487 

7547 

7607 

7667 

7727 

7787 

7847 

7907 

7967 

8027 

8087 

8147 

8207 

8267 

8327 

8387 
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ccatggttat cgtgtttctt ttggtgcttg tgctgtggat ttttggcggc agcttcaaca 8447 

tcgacgctac cacaaccgca ttgatcggtt tggccgttct cttattatca caagttctga 8507 

cttgggatga tatcaagaaa gaacagggcg ctngggatac gctcacttgg tttgcggcgc 8567 

ttgtcatgct cgccaacttc ttgaatgaat taggcatggt gtcttggttc agtaatgcca 8627 

tgaaatcatc cgtatcaggg ttctcttgga ttgtggcatt catcatttta attgttgtgt 8687 

attattactc tcactatttc tttgcaagtg cgacagccca catcagtgcg atgtattcag 8747 

catttttggc tgtcgtcgtg gcagcgggcg caccgccgct tttagcagcg ctgagcctcg 8807 

cgttcatcag caacctgttc gggtcaacga ctcactacgg ttctggagcg gctccggtct 8867 

tcttcggagc aggctacatc ccgcaaggca aatggtggtc catcggattt atcctgtcga 8927 

ttgttcatat catcgtatgg cttgtgatcg gcggattatg gtggaaagta ctaggaatat 8987 

ggtagaaaga aaaaggcaga cgcggtctgc ctttttttat tttcactcct tcgtaagaaa 9047 

atggattttg aaaaatgaga aaattccctg tgaaaaatgg tatgatctag gtagaaagga 9107 

cggctggtgc tgtggtgaaa aagcggttcc atttttccct gcaaacaaaa ataatggggc 9167 

tgattgcggc tctgctggtc tttgtcattg gtgtgctgac cattacgtta gccgttcagc 9227 

atacacaggg agaacggaga caggcagagc agctggcggt tcaaacggcg agaaccattt 9287 

cctatatgcc gccggttaaa gagctcattg agagaaaaga cggacatgcg gctcagacgc 9347 

aagaggtcat tgaacaaatg aaagaacaga ctggtgcgtt tgccatttat gttttgaacg 9407 

aaaaaggaga cattcgcagc gcctctggaa aaagcggatt aaagaaactg gagcgcagca 9467 

gagaaatttt gtttggcggt tcgcatgttt ctgaaacaaa agcggatgga cgaagagtga 9527 

tcagagggag cgcgccgatt ataaaagaac agaagggata cagccaagtg atcggcagcg 9587 

tgtctgttga ttttctgcaa acggagacag agcaaagcat caaaaagcat ttgagaaatt 9647 

tgagtgtgat tgctgtgctt gtactgctgc tcggatttat tggcgccgcc gtgctggcga 9707 

aaagcatcag aaaggatacg ctcgggcttg aaccgcatga gatcgcggct ctatatcgtg 9767 

agaggaacgc aatgcttttc gcgattcgag aagggattat tgccaccaat cgtgaaggcg 9827 

tcgtcaccat gatgaacgta tcggcggccg agatgctgaa gctgcccgag cctgtgatcc 9887 

atcttcctat agatgacgtc atgccgggag cagggctgat gtctgtgctt gaaaaaggag 9947 

aaatgctgcc gaaccaggaa gtaagcgtca acgatcaagt gtttattatc aatacgaaag 10007 

tgatgaatca aggcgggcag gcgtatggga ttgtcgtcag cttcagggag aaaacagagc 10067 

tgaagaagct gatcgacaca ttgacagagg ttcgcaaata ttcagaggat ctcagggcgc 10127 

agactcatga attttcaaat aagctttatg cgattttagg gctgc 10172 

<210> 14 
<211> 380 
<212> PRT 
<213> Artificial 

<220> 

<223> synthetic construct 

Page 23 



wo 2005/035747 PCT/DK2004/000688 

10508.204-WO.ST25 

<400> 14 

Met Lys Lys Pro Leu Gly Lys lie Val Ala ser Thr Ala Leu Leu 
-190 -185 -180 

lie ser val Ala Phe Ser ser ser lie Ala ser Ala Ala Thr Gly 
-175 -170 -165 

Ala Leu Pro Gin ser Pro Thr Pro Glu Ala Asp Ala val ser Met 
-160 -155 -150 

Gin Glu Ala Leu Gin Arg Asp Leu Asp Leu Thr ser Ala Glu Ala 
-145 -140 -135 

Glu Glu Leu Leu Ala Ala Gin Asp Thr Ala Phe Glu Val Asp Glu 
-130 -125 -120 

Ala Ala Ala Glu Ala Ala Gly Asp Ala Tyr Gly Gly ser val Phe 
-115 -110 -105 

ASD Thr Glu Ser Leu Glu Leu Thr val Leu val Thr Asp Ala Ala Ala 
-100 -95 -90 

val GlLr*Air^al^GTu''A^^ Ala Gly Thr Glu Leu Val ser Tyr 

-85 -80 -75 

Gly lie Asp Gly Leu Asp Glu He Val Gin Glu Leu Asn Ala Ala Asp 
-70 -65 -60 -55 

Ala val Pro Gly val val Gly Trp Tyr Pro Asp val Ala Gly Asp Thr 

-50 -45 -40 

val val Leu Glu val Leu Glu Gly ser Gly Ala Asp val ser Gly Leu 

-35 -30 -25 

Leu Ala Asp Ala Gly val Asp Ala ser Ala val Glu Val Thr Thr Ser 
-20 -15 -10 

Asp Gin Pro Glu Leu Tyr Ala Asp lie He Gly Gly Leu Ala Tyr Thr 
-5 -11 5 10 

Met Gly Gly Arg cys Ser val Gly Phe Ala Ala Thr Asn Ala Ala Gly 

15 20 25 

Gin pro Gly Phe Val Thr Ala Gly His cys Gly Arg val Gly Thr Gin 

Val Thr He Gly Asn Gly Arg Gly Val Phe Glu Gin Ser Val Phe Pro 
45 50 55 

Gly Asn Asp Ala Ala Phe Val Arg Gly Thr ser Asn Phe Thr Leu Thr 

Page 24 



wo 2005/035747 



PCT/DK2004/000688 



10508. 204-WO-ST25 
60 65 70 

Asn Leu val ser Arg Tyr Asn Thr Gly Gly Tyr Ala Thr val Ala Gly 
75 80 85 90 

His Asn Gin Ala Pro lie Gly ser ser Val Cys Arg Ser Gly ser Thr 

95 100 105 

Thr Gly Trp His Cys Gly Thr lie Gin Ala Arg Gly Gin ser val Ser 

110 115 120 

Tvr Pro Glu Gly Thr val Thr Asn Met Thr Arg Thr Thr Val cys Ala 
125 130 135 

Glu pro Gly Asp Ser Gly Gly Ser Tyr He ser Gly Thr Gin Ala Gin 
140 145 150 

Gly val Thr ser Gly Gly ser Gly Asn cys Arg Thr Gly Gly Thr Thr 
155 160 165 170 

Phe Tyr Gin Glu val Thr Pro Met val Asn ser Trp Gly Val Arg Leu 

175 180 185 

^Ar^Thr ^ 



<210> 15 

<211> 35 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 

<400> 15 , c 

ggagctctga aaaaaaggag aggataaaga atgaa 

<210> 16 

<211> 29 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 

<400> 16 

gcgttccgat aatcgcggtg acaatgccg 

<210> 17 

<211> 29 

<212> DNA 

<213> Arti f i ci al 
<220> 

<223> Primer 

<400> 17 

Page 25 



wo 2005/035747 PCT/DK2004/000688 

10508.204-WO.ST25 

ttcatgagtc tgcgccctga gatcctctg 

<210> 18 

<211> 30 

<212> DNA 

<213> Artificial 

<220> 

<223> primer 

<400> 18 30 
taatcgcatg ttcaatccgc tccataatcg 

<210> 19 

<211> 44 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 



<400> 19 

cccaacggtt tcttcattct ttatcctctc ctttttttca gage 



<210> 20 
<211> 1164 
<212> DNA 



<220> 

<223> Protease 22 



<220> 

<221> CDS 

<222> (1) . . (1164) 

<220> 

<221> sig^peptide 

<222> (l5 - . (81) 

<220> 

<2 2 1> mat_pept i de 

<222> (577) . . (1164) 



44 



45 



90 



135 



atq^aaa^aaa ccg ctg gga aaa att gtc gca age aca gca ctt ctt 
Met Lvs Lys pro LeG Gly Lys He Val Ala ser Thr Ala Leu Leu 
-190 -185 -lo^ 

att tea qtg gca ttt age tea tct att gca tea gca get aca gga 
lie ser val Ala Phe sir ser ser He Ala ser Ala Ala Thr cTy 
-175 -170 -165 

gca tta ccg eag tct ccg aca ccg gaa gca gat gca gtc tea atg 
Ala Leu Pro Gin ser Pro Thr Pro Glu Ala Asp Ala val ser Met 
-160 -155 -150 

caa aaa gca ctg caa aga gat ctt gat ctt aca tea gca gaa gca 
Gin Glu Ala LeG Gin Arg Asp Leu Asp Leu Thr ser Ala Glu Ala 
-145 -140 -135 

gaa gaa ctt ctt get gca caa gat aca gca ttt gaa gtg gat gaa 225 
Glu Glu Leu Leu Ala Ala Gin Asp Thr Ala Phe Glu Val Asp Glu 
-130 -125 -120 
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270 



318 



366 



414 



462 



510 



558 



654 



gca gcg gca gaa gca gca gga gat gca tat ggc ggc tea gtt ttt 
Ala Ala Ala Glu Ala Ala Gly Asp Ala Tyr Gly Gly ser val Phe 
-115 -110 -105 

gat aca gaa tea ctt gaa ctt aca gtt ctt gtt aca gat gca gca gca 
ASD Thr Glu ser Leu Glu Leu Thr Val Leu Val Thr Asp Ala Ala Ala 
^ -100 -95 -90 

gtt gaa gca gtt gaa gca aca gga gca gga aca gta ctt gtt tea tat 
val Glu Ala val Glu Ala Thr Gly Ala Gly Thr val Leu val Ser Tyr 
-85 -80 -75 

gga att gat ggc ctt gat gaa att gtt caa gaa ctg aat gca get gat 
Gly lie Asp Gly Leu Asp Glu lie val Gin Glu Leu Asn Ala Ala Asp 
-70 -65 -60 -55 

get gtt ceg ggc gtt gtt ggc tgg tat ceg gat gtt get gga gat aca 
Ala val Pro Gly Val Val Gly Trp Tyr Pro Asp Val Ala Gly Asp Thr 

-50 -45 -40 

gtt gte ctt gaa gtt ctt gaa gga tea ggc gca gat gtt tea ggc ctg 
val val Leu Glu val Leu Glu Gly ser Gly Ala Asp val ser Gly Leu 

-35 -30 -25 

ctq qea gae gca gga gte gat gca tea gca gtt gaa gtt aca aca tea 
Leu Ala Asp Ala GTy val Asp Ala ser Ala val Glu Val Thr Thr ser 
-20 -15 -10 

gat caa ceg gaa ctt tat gca gat att att ggc ggc ctg gca tat tat 606 
Jsp-«lT*U2ra-<eajJ-=bau.^¥«»^^ 

atq QQC ggc aga tgc age gtt ggc ttt gca gca aca aat gca tea ggc 
Met GTy GTy Arg cys Ser Val cTy Phe Ala Ala Thr Asn Ala ser GTy 

15 20 25 

caa ceg ggc ttt gtt aca gca ggc cat tgc ggc aca gtt ggc aca cca 
Gin Pro Gly Phe val Thr Ala Gly His cys Gly Thr val Gly Thr Pro 

30 35 40 

att tea att ggc aat ggc aaa ggc gtt ttt gaa cga age att ttt ceg 
vJl ser lie cTy Asn GTy Lys cTy val Phe Glu Arg Ser He Phe Pro 
45 50 55 

ggc aat gat tea gca ttt gtt aga ggc aca tea aat ttt aca ctt aca 
Gly Asn Asp ser Ala Phe val Arg GTy Thr ser Asn Phe Thr Leu Thr 
60 65 70 

aat ctg gtt tea aga tat aat tea ggc ggc tat gca aca gtt gca ggc 
Asn Leu val Ser Arg Tyr Asn ser GTy GTy Tyr Ala Thr Val Ala Gly 
75 80 85 90 

cat aat caa gca ceg att ggc tea gca gtt tgc aga tea ggc tea aca 
His Asn Gin Ala Pro He Gly Ser Ala val cys Arg ser Gly ser Thr 

95 100 105 

aca qge tgg cat tgc ggc aca att caa gca aga aat caa aca gtt agg 
Thr Gly Trp His cys Gly Thr He Gin Ala Arg Asn Gin Thr Val Arg 

tat ceg caa ggc aca gtt tat agt ctg aca aga aca aca gtt tgt gca 
Tvr pro Gin Gly Thr val Tyr ser Leu Thr Arg Thr Thr val cys Ala 
125 130 135 

gaa ceg ggc gat tea ggc ggc tea tat att age ggc act caa gca caa 
Glu Pro Gly Asp ser Gly Gly ser Tyr He ser Gly Thr Gin Ala Gin 
140 145 150 
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846 



894 



942 



990 
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ggc gtt aca tea ggc ggc tea ggc aat tgc agt get ggc ggc aca aca 

Gly val Thr ser Gly Gly Ser Gly Asn Cys ser Ala Q\y Gly Thr Thr 

155 160 165 170 

tat tac caa gaa gtt aat ccg atg ctt agt tea tgg ggc ctt aca ctt 

Tyr Tyr Gin Glu val Asn Pro Met Leu ser ser Trp Gly Leu Thr Leu 

175 180 185 



1086 



Ala Leu Pro Gin Ser Pro Thr Pro Glu Ala Asp Ala val ser Met 
-160 -155 -150 

Gin Glu Ala Leu Gin Arg Asp Leu Asp Leu Thr ser Ala Glu Ala 
-145 -140 -135 

Glu Glu Leu Leu Ala Ala Gin Asp Thr Ala Phe Glu Val Asp Glu 
-130 -125 -120 

Ala Ala Ala Glu Ala Ala Gly Asp Ala Tyr Gly Gly ser val Phe 
-115 -110 -105 

Asp Thr Glu ser Leu Glu Leu Thr val Leu Val Thr Asp Ala Ala Ala 
-100 -95 -90 

Val Glu Ala val Glu Ala Thr Gly Ala Gly Thr val Leu val Ser Tyr 
-85 -80 -75 

Gly lie Asp Gly Leu Asp Glu lie Val Gin Glu Leu Asn Ala Ala Asp 
-70 -65 -60 -55 

Ala Val Pro Gly Val Val Gly Trp Tyr Pro Asp val Ala Gly Asp Thr 

-50 -45 -40 

val val Leu Glu Val Leu Glu Gly ser Gly Ala Asp val ser Gly Leu 

-35 -30 -25 



1134 



aga aca caa teg cat gtt caa tec get cea 1164 
Arg Thr Gin ser His val Gin ser Ala Pro 

190 195 

<210> 21 
<211> 388 
<212> PRT 
<213> Artificial 

<220> 

<223> synthetic construct 
<400> 21 

Met Lys Lys Pro Leu Gly Lys lie val Ala Ser Thr Ala Leu Leu 
-190 -185 -180 

lie Ser Val Ala Phe ser Ser ser He Ala ser Ala Ala Thr Gly 
-175 -170 -165 
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Leu Ala Asp Ala Gly Val Asp Ala Ser Ala val Glu Val Thr Thr Ser 
-20 -15 -10 

Asp Gin Pro Glu Leu Tyr Ala Asp lie lie Gly Gly Leu Ala Tyr Tyr 
-5 -11 5 10 

Met Gly Gly Arg cys Ser Val Gly Phe Ala Ala Thr Asn Ala ser Gly 

15 20 25 

Gin pro Gly Phe val Thr Ala Gly His Cys Gly Thr val Gly Thr Pro 

30 35 40 

Val Ser lie Gly Asn Gly Lys Gly val Phe Glu Arg ser lie Phe Pro 
45 50 55 

Gly Asn Asp Ser Ala Phe Val Arg Gly Thr ser Asn Phe Thr Leu Thr 
60 65 70 

Asn Leu Val ser Arg Tyr Asn ser Gly Gly Tyr Ala Thr val Ala Gly 
75 80 85 90 

His Asn Gin Ala Pro lie Gly ser Ala val Cys Arg ser Gly ser Thr 

95 100 105 

Thr Gly Trp His cys Gly Thr lie Gin Ala Arg Asn Gin Thr val Arg 

110 115 120 

Tyr Pro Gin Gly Thr Val Tyr Ser Leu Thr Arg Thr Thr val cys Ala 
125 130 135 

Glu Pro Gly Asp Ser Gly Gly ser Tyr lie Ser Gly Thr Gin Ala Gin 
140 145 150 

Gly Val Thr ser Gly Gly ser Gly Asn cys ser Ala Gly Gly Thr Thr 
155 160 165 170 

Tyr Tyr Gin Glu Val Asn Pro Met Leu Ser ser Trp Gly Leu Thr Leu 

175 180 185 

Arg Thr Gin ser His Val Gin ser Ala Pro 

190 195 



Page 29 



